Picture for Yue Zhu

Yue Zhu

From Pixels to Words -- Towards Native One-Vision Models at Scale

Add code
May 27, 2026
Viaarxiv icon

Boiling the Frog: A Multi-Turn Benchmark for Agentic Safety

Add code
May 21, 2026
Viaarxiv icon

SenseNova-U1: Unifying Multimodal Understanding and Generation with NEO-unify Architecture

Add code
May 12, 2026
Viaarxiv icon

Asset Harvester: Extracting 3D Assets from Autonomous Driving Logs for Simulation

Add code
Apr 20, 2026
Viaarxiv icon

VISTA-Bench: Do Vision-Language Models Really Understand Visualized Text as Well as Pure Text?

Add code
Feb 04, 2026
Viaarxiv icon

IIB-LPO: Latent Policy Optimization via Iterative Information Bottleneck

Add code
Jan 09, 2026
Viaarxiv icon

Regularizing Subspace Redundancy of Low-Rank Adaptation

Add code
Jul 28, 2025
Figure 1 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Figure 2 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Figure 3 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Figure 4 for Regularizing Subspace Redundancy of Low-Rank Adaptation
Viaarxiv icon

Expectation Confirmation Preference Optimization for Multi-Turn Conversational Recommendation Agent

Add code
Jun 17, 2025
Viaarxiv icon

Towards Efficient Key-Value Cache Management for Prefix Prefilling in LLM Inference

Add code
May 28, 2025
Viaarxiv icon

AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection

Add code
May 15, 2025
Figure 1 for AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Figure 2 for AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Figure 3 for AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Figure 4 for AdaptCLIP: Adapting CLIP for Universal Visual Anomaly Detection
Viaarxiv icon